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Abstract 

Previous genetic studies of eastern coyotes (Canis latrans) are based on one of 
two strategies: sannpling nnany individuals using one or very few nnolecular 
nnarkers, or sannpling very few individuals using nnany genonnic nnarkers. Thus, 
a regional analysis of genetic diversity and population structure in eastern 
coyotes using nnany sannples and several nnolecular nnarkers is lacking. I 
evaluated genetic diversity and population structure in 385 northeastern 
coyotes using 16 connnnon single nucleotide polynnorphisnns (SNPs). A 
region-wide analysis of population structure revealed three prinnary genetic 
populations, but these do not correspond to the sanne three subdivisions 
inferred in a previous analysis of nnitochondrial DNA sequences. More focused 
geographic analyses of population structure indicated that annple genetic 
structure occurs in coyotes fronn an internnediate contact zone where two range 
expansion fronts nneet. These results dennonstrate that genotyping several 
highly heterozygous SNPs in a large, geographically dense sannple is an 
effective way to detect cryptic population genetic structure. The innportance of 
SNPs in studies of population and wildlife genonnics is rapidly increasing; this 
study adds to the growing body of recent literature that dennonstrates the utility 
of SNPs ascertained fronn a nnodel organisnn for evolutionary inference in 
closely related species. 
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Introduction 

Historically restricted to the open deserts and plains of central and 
western North America, the coyote (Canis latrans) has colonized 
almost the whole continent in the last 100 years, with few excep- 
tions {e.g. Long Island) The eastward range expansion was likely 
facilitated by widespread deforestation associated with agricultural 
development in the early 20^*" century and by the near extirpation 
of eastern wolves {Canis lupus lycaon or Canis lycaon) and red 
wolves {Canis rufusf'^. The range expansion into northeastern 
North America advanced as two primary colonization fronts: the 
northern route through Ontario, where coyotes hybridized with res- 
ident eastern wolves, and the southern route through Ohio, where 
wolves were eradicated prior to coyote expansion- Along with 
its geographic range expansion, the coyote also experienced a niche 
expansion by rapidly colonizing whole new biomes, including east- 
ern temperate and boreal forests ^ This complex scenario of colo- 
nization provides an interesting opportunity to explore the swift 
formation of population genetic structure following a rapid expan- 
sion in geographic distribution and ecological niche. 

Recent analyses of population structure in northeastern coyotes 
have described a general lack of genetic differentiation among sam- 
pling localities, except at the coarsest geographic scales. Way et aV 
examined genetic variation and structure in a sample of coyotes 
from eastern Massachusetts using mitochondrial DNA (mtDNA) 
and eight microsatellite loci. They found no genetic structure in 
coyotes within Massachusetts or even within the broader region of 
northeastern North America. Instead, northeastern coyotes seemed 
to constitute one uniform population slightly differentiated from 
western coyotes. In another analysis of genetic variation in north- 
eastern coyotes, Kays et aV identified three coarse phylogeographic 
areas: Ohio, the northeast zone, and a contact zone in western 
Pennsylvania and New York where the colonization front from 
Ohio has spread into the northeastern population. Although Kays 
et al. surveyed genetic variation in a dense geographic sample of 
687 coyotes, they only used one genetic marker, the hypervariable 
mtDNA control region. vonHoldt et al.^ conducted a genome- wide 
analysis of North American Canis species and detected population 
structure in C. latrans, but only at the broadest continental scale. 
Although vonHoldt et al. genotyped tens of thousands of loci, they 
only sampled 14 northeastern coyotes, making detection of finer 
levels of population structure in the region very improbable. Thus, 
all the previous studies of population structure in northeastern 
coyotes have adopted one of two strategies: sampling many indi- 
viduals using one (mtDNA) or very few molecular markers, or sam- 
pling very few individuals using thousands of genomic markers. A 
regional analysis of genetic diversity and population structure in 
northeastern coyotes using many samples and many nuclear molecu- 
lar markers is currently lacking. 

Genetic structure is a ubiquitous property of natural, domesticated, 
and human populations. Population genetic structure plays consid- 
erable roles in evolution, as both the basis and the consequence of 
local adaptation, the splitting of one species into two if the envi- 
ronments are markedly different, and the adaptability of a species 
as a whole across its range {i.e., transformation rather than spe- 
ciation)'^. The detection of genetic structure largely depends on the 



type and number of molecular markers examined, their variability 
in the target population, the number of individuals sampled, and 
the spatial sampling scheme "^'^^ Single nucleotide polymorphisms 
(SNPs) have become a popular and inexpensive tool in the field 
of molecular population genetics. SNPs have properties that make 
them a superior alternative to other widely used genetic markers, 
such as microsatellites and mtDNA sequences, in evaluating genetic 
diversity and population structure Furthermore, the sampling 
scheme can greatly influence the location and composition of 
genetic clusters, especially in species that are continuously distrib- 
uted across a landscape^ ^ as are northeastern coyotes. Thus the con- 
clusion of no population structure from previous analyses based on 
limited individual or genomic sampling may be imprecise. 

The objective of this study was to test the hypothesis that fine- scale 
population structure in northeastern coyotes exists, but remains 
undetected due to the small number of individuals or the low reso- 
lution of the genetic markers previously analyzed. I hypothesized 
that population structure would be detectable, at finer levels than in 
previous analyses, by using an array of 16 high-heterozygosity 
nuclear SNPs ascertained from the dog genome and a spatially dense 
sample of 385 coyotes. This is the first comprehensive regional 
survey of genetic diversity and population structure in northeastern 
coyotes that uses a dense geographic sampling scheme and several 
SNPs. This regional analysis reveals a cryptic population structure 
and a geographic pattern of nuclear genetic diversity that is discrep- 
ant with previous mtDNA- and microsatellite-based surveys. More 
generally, this study adds to the growing body of recent literature 
that demonstrates the utility of SNPs discovered in a model organ- 
ism for evolutionary inference in wild relatives, as long as ascer- 
tainment bias is explicitly evaluated. 

Materials and methods 

Study area and sampling 

The study area was located in northeastern North America. Coyotes 
were sampled from New York (N = 174), Pennsylvania (N = 103), 
Vermont (N = 34), Ohio (N = 30), New Jersey (N = 14), New 
Hampshire (N = 11), Connecticut (N = 8), Massachusetts (N = 5), 
southern Quebec (N = 4), and Rhode Island (N = 2) (Figure 1). 
All samples (Total N = 385) used in this study are archived and 
vouchered in the New York State Museum, Albany, NY, where they 
were kept at -80°C. Specimens (tissues or combination of skin, 
skull, skeleton) were obtained primarily through donations from 
licensed local hunters and trappers since 1999. Six samples came 
from previous scat surveys in New York^^'^^' (Data File). No Insti- 
tutional Animal Care and Use Committee (lACUC) review was 
required for this study because the DNA samples came from scat or 
animals killed for reasons other than research. 

Marker selection and laboratory methods 

I selected molecular markers based on a genomics study that used 
the Affymetrix Canine Mapping Array to genotype 61,435 SNP 
loci in hundreds of wild and domestic canids, including 14 north- 
eastern and 3 Ohio coyotes^'^^. I used the program PLINK^^ to 
compute observed and expected heterozygosity per locus in the 
subset of 17 northeastern and Ohio coyotes. I selected 16 unlinked 
SNPs, each on a different autosomal chromosome, with the highest 
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Figure 1. Study area and sampling localities of 385 coyotes from northeastern United States and southeastern Canada. Symbol 
shapes represent phylogeographic zones as in Kays et al.^: circle, Ohio; square, contact zone; triangle, northeast zone. 



observed heterozygosity, a measure of genetic variability in a popu- 
lation (Table 1). Ascertainment schemes that select highly variable 
SNP loci increase power to detect population structure^'; accord- 
ingly, these 16 informative SNPs with high heterozygosity and high 
minor allele frequencies allowed me to assess genetic variation and 
population structure in the larger target sample. I designed prim- 
ers using the PrimerS software and tested them in silico against 
the dog CanFam2 genome assembly (GenBank Assembly ID: 
GCA_000002285.1)2" using the University of California, Santa 
Cruz In-Silico PCR and BLAT web tools (http://genome.ucsc.edu). 

I extracted total genomic DNA from muscle or hide using the 
DNeasy Blood and Tissue Kit (Qiagen, Valencia, USA) according 
to the manufacturer's instructions, and also used DNA samples that 
were extracted in the New York State Museum as described in Kays 
et alW quantified DNA concentrations using a NanoDrop ND- 1000 
Spectrophotometer (Thermo Scientific, Wilmington, USA) and 
diluted the samples in water to attain concentrations of 5-30 ng/jal. 
I prepared four 96-well plates with template genomic DNA of 378 
eastern coyotes, seven of which served as controls because they 
were already genotyped with the canine SNP microarray^ Two 



western coyotes from Washington and two western gray wolves 
{Canis lupus) from Yellowstone National Park served as addi- 
tional controls; two wells containing only water served as negative 
controls. 

SNP genotyping was performed at the GenoSeq Core laboratory 
in the University of California, Los Angeles, using a high resolu- 
tion melting curve quantitative PCR assay on a LightCycler 480 
thermal cycler (Roche, Indianapolis, USA). Each of the 16 SNPs 
was amplified and genotyped separately. DNA was amplified in a 
total volume of 10 |il, including 1 jal (5-30 ng) of genomic DNA, 
0.2 |iM of each primer, 4.2 mM MgCl^, and Ix Roche High Res- 
olution Melting Master kit mix. The latter contains FastStart Taq 
DNA polymerase, dNTP mix, and ResoLight, a high resolution 
melting dye that fluoresces when DNA is double- stranded. During 
the melt curve analysis, the temperature increases very slowly to 
denature double- stranded DNA. Samples with variations in DNA 
sequence, even in one base pair, are distinguished by discrepancies 
in the shape of the melt curve, thus discriminating each of the two 
homozygous and the heterozygous genotypes. I processed the raw 
data using the Gene Scanning module of the Roche LightCycler 
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Table 1. Sixteen single nucleotide polymorphisms (SNPs) and their primer pairs for high-resolution melt curve genotyping assay. 

SNP ID chromosomal coordinates and dbSNP rs numbers correspond to the CanFam2 dog genome assembly. Measures 
of genetic variability correspond to the initial seventeen northeastern and Ohio coyotes {Cam's latrans) that formed the 
ascertainment panel. 



SNP ID 


dbSNP rs# 


Forward primer 


Reverse primer 


PGR 
product 
size (bp) 


MAF 


Ho 


He 


chr1:92426160 


rs21 988674 


GGGGTTTCTGAAGTGCTGAC 


TGTGATAGCCACAGAAAAGCA 


92 


0.382 


0.765 


0.472 


chr3:601 34962 


rs8795212 


CACTGAGGAATGCTGGGAAG 


TCAGGAAGTCTACTCCAGTGTCTG 


80 


0.441 


0.765 


0.493 


chr4:33800600 


rs24071674 


ATCACCTCCAGAAAGCCAAG 


TAAGGATCATCCCCTCGTTC 


95 


0.382 


0.765 


0.472 


chr5:65740765 


rs24241051 


GGACCTCCATAGGACATCCA 


TGTGTGGGGAGATGCAAAT 


97 


0.500 


0.765 


0.500 


chr6:17110138 


rs24352476 


CAGTCACAATGGGGTGTCAG 


AAGCGGGAGGTAGTATTACTGGT 


97 


0.441 


0.765 


0.493 


chr8:69650155 


rs245 14604 


G CTC CTG G CTATTG TATTTTC C 


TTC AATTCTG C ATG GTTG GT 


99 


0.441 


0.765 


0.493 


chr1 0:45343436 


rs22055760 


TCTTTGAGGACATGGAACGA 


TCACTCTGGAGACCAAGACG 


94 


0.500 


0.765 


0.500 


chr11:66863044 


rs8946304 


TGGGTAATTTAATCAACGAGGAA 


AAAAGCAAGAGGAGGGAACC 


92 


0.441 


0.765 


0.493 


chr12:17166054 


rs221 84574 


CAACGGCTGGATTCTGACTA 


GCACACTGGTGTAGCAGAGC 


118 


0.441 


0.765 


0.493 


chr16:9533917 


rs22444520 


TTGATAAATCAAAACCTGGGATG 


GATCTGGCCCACAGCTCA 


96 


0.441 


0.765 


0.493 


Chr1 7:31 508687 


rs22603056 


CAAAAATCAGGGATACAGACAAG 


GCCAGAGAATGCCATCTTTA 


100 


0.471 


0.824 


0.498 


chr1 9:5061 8604 


rs22758397 


TTTTTCCCTGCCTGATTTTT 


TTGGAAAGAGATGTCAAGATGG 


92 


0.441 


0.765 


0.493 


Chr22:57259397 


rs23051971 


GTAGAGGACACCCTTAGATGTGG 


TGTCTGGAGGGAGTTCAACA 


95 


0.500 


0.765 


0.500 


Chr25:44793770 


rs23209441 


TGACTCACCCAAGGTGATATG 


CAGCTCTGATCATGCCAAAT 


100 


0.471 


0.824 


0.498 


chr27:5811313 


rs23365246 


AATCACACACGAGCAACACC 


CTGCTTGTCCTGGGATGAA 


96 


0.471 


0.824 


0.498 


Chr37:26421162 


rs9205317 


GGCTCCCAGCTAACTGTTCA 


AGCTATCCAGAAGCCCAAGAG 


93 


0.471 


0.824 


0.498 



Note: MAF: minor allele frequency; and H^: observed and expected heterozygosity. Several SNPs have the same MAF, H^, and due to the relatively small 
number of coyotes in the ascertainment panel. For example, = 0.765 if 13 of 17 individuals were heterozygous. 



software and followed Roche's recommendations for evaluating 
data quality. The software automatically generates genotypes from 
the raw melt curve data. All SNPs were biallelic, so there are three 
possible genotypes per locus-AA, AB and BB-each forming three 
distinct clusters when signal intensity is plotted against tempera- 
ture. I visually inspected all software-generated genotype calls and 
manually removed the ambiguous ones that did not conform to any 
of the three possible genotype clusters, 

Analyses of genetic diversity and population structure 

To the 378 samples genotyped in this study, I added 7 more north- 
eastern coyote samples genotyped by vonHoldt et a/. ^ making a total 
sample size of 385. 1 used PLINK to calculate average observed and 
expected heterozygosity, measures of genetic diversity, and devia- 
tions from Hardy- Weinberg equilibrium in the overall sample and 
in each of the three phylogeographic zones inferred by Kays et al.^ 
(Figure 1). In order to assess the ascertainment bias of using dog 
genome SNPs to study genetic diversity and population structure 
in coyotes, I compared genetic variation in five subsets of canids: 
northeastern coyotes, western coyotes, western gray wolves. Great 
Lakes wolves, and dogs (Canis familiarisY'^^ I calculated average 
observed and expected heterozygosity using all 61,435 SNPs from 
the microarray because the vast majority of the loci in the microar- 
ray were ascertained by dog-dog comparisons^^ and again using 
only the 16 high-heterozygosity SNPs genotyped in this study. 



To assess population genetic structure, I excluded all individuals 
with more than five missing genotypes, resulting in 247 coyotes (16 
in Ohio, 118 in contact zone, 1 13 in northeast zone). I used the pro- 
gram STRUCTURE 2.3-^ to infer the most likely number of genetic 
populations. STRUCTURE implements a Bayesian algorithm to 
assign multilocus genotypes to genetic clusters by calculating the 
likelihood that a group of individuals constitutes a population. I 
tested whether finer population structure was detectable with the 
high-heterozygosity SNPs relative to the coarse structure detected 
with hypervariable mtDNA. I analyzed all 247 coyotes together 
to test whether more than three populations were detectable at 
the regional level, and then analyzed each zone separately to test 
whether more than one population was detectable within each zone. 
For all analyses, I used three replicate runs of 20,000 burn-in and 
100,000 Markov chain Monte Carlo iterations, used the admixture 
ancestry model with correlated allele frequencies--, and set the 
number of populations from K=ltoK = 8.I verified that alpha 
and likelihood statistics reached convergence during the burn-in 
period for each number of populations analyzed. I used STRUC- 
TURE HARVESTER^^ to evaluate the relative support for each value 
of K by plotting Ln P(D), the mean posterior probability of the 
data-^ and AK, a quantity related to the second-order rate of change 
of the likelihood function with respect to I used CLUMPP^"^ 
to align and average the three replicate cluster membership coef- 
ficient matrices, and ArcMap 10 (Esri, Redlands, USA) to visualize 
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the spatial distribution of genetic structure. I considered individuals 
with ancestry coefficients q > 0.8 as belonging to a specific clus- 
ter and individuals with all q < 0.8 as being admixed, consistent 
with recent genetic investigations of Canis^^'^^\ In order to corrobo- 
rate inferences from the STRUCTURE analysis with a model-free 
approach, I conducted a principal components analysis (PCA) using 
the adegenet 1.3^ package^^ in R^l I also computed pairwise F^^, 
the inbreeding coefficient within populations relative to the total, 
among the populations inferred by STRUCTURE and tested the 
significance of the differentiation by analysis of molecular variance 
(AMOVA) using 999 permutations in GenAlEx 6S-'''''\ 

Results 

I interrogated 16 SNP loci in 378 coyotes for a total of 6,048 
expected genotypes (Data File). Genotyping efficacy varied by 
source of DNA, with fecal samples amplifying less effectively than 
tissue samples (Mann-Whitney test: U = 1771, P = 0.017). That 
is, the fecal samples had, on average, more missing or ambiguous 
genotypes than the tissue samples. 



the lowest levels of mitochondrial genetic diversity, these same 
individuals had a level of nuclear genetic diversity comparable to 
Ohio coyotes (Table 2). The most pronounced differences between 
observed and expected heterozygosity occurred in the contact zone 
and in the overall regional analysis (Table 2). In the overall sample of 
385 eastern coyotes, five loci were in Hardy- Weinberg equilibrium. 
The number of loci in Hardy -Weinberg equilibrium increased when 
each phylogeographic zone was analyzed separately: 13 in Ohio, 
nine in the contact zone, and nine in the northeast zone (Table 2). 

Dogs appeared to be the most genetically diverse when the diversity 
of the five different canid groups was estimated using all 61,435 
SNPs from the canine microarray. The genome-wide ascertain- 
ment bias was towards dogs: the expected heterozygosity of dogs 
was almost twice that of western coyotes. But the ascertainment 
bias reversed when heterozygosity was measured using only the 16 
selected SNPs: coyotes appeared to be the most genetically diverse, 
with northeastern coyotes having a very high expected heterozy- 
gosity, whereas dogs appeared the least genetically diverse (Table 3). 



The overall sample of 385 eastern coyotes had lower genetic diver- 
sity than expected (Table 2), even though observed heterozygosity 
generally exceeded expected heterozygosity in the ascertainment 
panel of 17 northeastern and Ohio coyotes (Table 1). Ohio coyotes 
were the most genetically diverse in the region, but the eastward 
decay in genetic diversity observed with mtDNA was not replicated 
with nuclear SNPs. Although coyotes from the northeast zone had 



In the region- wide population structure analysis of 247 individuals 
with little missing data, the values of K with the strongest statistical 
support were K = 2 and K = 3 (Figure 2 A, B). Pairwise F^^ among 
the three genetic clusters varied from 0.08 to 0.10 and all were sig- 
nificant in the AMOVA framework (P = 0.001). This indicates that 
there are three primary genetic subdivisions in the broad sampling 
area. The three groups did not correspond to the Ohio, contact, and 



Table 2. Genetic diversity of eastern coyotes {Canis latrans) measured with 
mtDNA sequences and 16 nuclear SNP genotypes. Most individuals genotyped 
at 16 nuclear SNPs represent a subset of those individuals sequenced. 



mtDNA control region High-heterozygosity SNPs 





N 


Haplotype 
diversity 


e (per site) 


N 


Ho 


He 


HWE 


Ohio 


30 


0.844 


0.018 


30 


0.465 


0.435 


13 


Contact 


207 


0.721 


0.014 


177 


0.312 


0.411 


9 


Northeast 


450 


0.664 


0.008 


178 


0.442 


0.457 


9 


Total 


687 


0.708 


0.013 


385 


0.388 


0.444 


5 



Note: mtDNA data and zone designations from Kays etal.^. N: sample size; and H^: 
observed and expected heterozygosity; HWE: number of loci in Hardy-Weinberg equilibrium. 



Table 3. Ascertainment bias of surveying genetic diversity in different 
groups of canids using SNPs discovered after completion of the dog 
genome assembly. 



Canid group 


N 


61,435 SNPs 


^ 16 high- ^ 
heterozygosity SNPs 




Ho 


He 


Ho 


He 


Northeastern coyotes 


14 


0.190 


0.202 


0.763 


0.493 


Western coyotes 


45 


0.147 


0.182 


0.387 


0.399 


Great Lakes wolves 


19 


0.187 


0.217 


0.278 


0.290 


Western gray wolves 


32 


0.203 


0.238 


0.271 


0.319 


Dogs 


50 


0.234 


0.359 


0.270 


0.387 



Note: Data for 61 ,435 SNPs from vonHoldt et al.^ N: sample size; and H^: 
observed and expected heterozygosity. 
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Figure 2. Regional population genetic structure of eastern coyotes. A) Estimation of the number of genetic populations using two criteria: 
AK (solid line) and Ln P(D) (dashed line). The most probable number of populations is the value of K with the maximal value of AK and Ln 
P(D). The data strongly support K = 2 and K = 3. B) STRUCTURE bar plots of N = 247 coyotes subdivided into K = 2 and K = 3 genetic 
populations; each individual is represented by a vertical bar partitioned into two or three colored segments indicating that individual's 
proportional membership in each of two or three genetic clusters. C) Spatial distribution of K = 3 genetic populations. Symbol colors represent 
the genetic cluster with > 80% assignment from panel B, or black if highly admixed (/. e., no assignment > 80%). Symbol shapes represent 
phylogeographic zones as in Figure 1 . Symbol locations are slightly jittered to display each individual and to reduce clutter. 
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northeast zones previously inferred by mtDNA (Figure 2C). There 
is some geographic structuring, but the three groups overlap exten- 
sively in space. Although the red cluster in Figure 2B includes most 
of the Ohio coyotes, it is more cosmopolitan, also including many 
coyotes from the contact and northeast zones. The green cluster is 
mostly restricted to the contact zone, but extends slightly into east- 
ern New York and Vermont. The PCA corroborated the results from 
STRUCTURE (Figure 3). The first two PCA axes explained 24.2% 
of the total variance and clearly separated the three STRUCTURE- 
inferred clusters. Most admixed individuals with no clear member- 
ship in any STRUCTURE-inferred cluster also showed no clear 
association with any PCA cluster. 

Ample population genetic structure was detected in the contact zone; 
the values of K with the strongest statistical support were K = 3 
and K = 5 (Figure 4A). However, the K = 5 structure seems bio- 
logically unrealistic, characterized by highly admixed individuals 
of the various "populations" (Figure 4B). Alternatively, the K = 3 
structure identified for the contact zone was virtually identical to 



the regional K = 3 structure (Figure 4C). In contrast to the contact 
zone, no fine- scale genetic structure was detected in Ohio or in the 
northeast zone. In Ohio, the value of K with the highest explanatory 
power was K = 1 (Figure 5A); in the northeast zone, the value of 
Ln P(D) does not increase beyond K = 2 and the change in Ln P(D) 
between K = 1 and K = 2 is minimal, indicating weak support for 
genetic structure (Figure 5B). 

f ^ 

Data of genetic diversity in nortlieastern coyotes 

2 Data Files 

Iittp://dx.doi.org/10.6084/m9.figsliare.943483 



Discussion 

This study documents the presence of fine- scaled population genetic 
structure in eastern coyotes. Specifically, the contact zone exhibits 
a strong signal of population structure, the same signal detected in 
the regional analysis. This pattern may reflect the recent merging of 





Figure 3. Principal components analysis of 16 autosomal SNPs genotyped in 247 eastern coyotes. Symbol colors correspond to K = 3 
STRUCTURE-inferred clusters with > 80% assignment, or black if highly admixed {i.e., no assignment > 80%), as in Figure 2C. Inset shows 
scree plot of eigenvalues. 
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Figure 4. Population genetic structure of eastern coyotes in the contact zone. A) Estimation of the number of genetic populations using 
two criteria: AK (solid line) and Ln P(D) (dashed line). The data strongly support K = 3 and K = 5. B) STRUCTURE bar plots of N = 11 8 coyotes 
subdivided into K = 3 and K = 5 genetic clusters. C) Spatial distribution of K = 3 genetic clusters. Symbol colors represent the genetic cluster 
with > 80% assignment from panel B, or black if highly admixed (i.e., no assignment > 80%). 
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Figure 5. Absence of fine-scale population genetic structure in A) 
N = 1 6 coyotes from Ohio, and B) N = 11 3 coyotes from the northeast 
zone. See Figure 1 for zone designations. 



two colonization fronts and the highly heterogeneous landscape of 
New York and Pennsylvania. A separate, spatially-explicit assess- 
ment with an independent set of SNPs demonstrated that northeast- 
ern coyotes exhibit a strong signal of population structure in Central 
New York^ Furthermore, the same investigation showed that popu- 
lation structure is partially explained by ecological factors, such as 
deer density and human land use^ Indeed, eastern coyotes living in 
areas of high deer density are genetically more wolf-like than those 
living in areas of low deer density 

In contrast to the contact zone, coyotes in Ohio appear to make up 
a single panmictic population, as do coyotes in the northeast zone. 
Given the high mtDNA and nuclear diversity observed in Ohio by 
Kays et aV and in this study, it is unlikely that the lack of genetic 
structure stems from a founder effect. The failure to uncover more 
than one genetic population in Ohio may be due instead to its land- 
scape homogeneity or to the lack of resolution afforded by 16 indi- 
viduals to detect finer levels of structure. On the other hand, the 
failure to uncover more than one genetic population in the northeast 
zone is surprising given the vast geographic area with its ecological 
heterogeneity and the large number of individuals sampled. Coyotes 
in the northeast zone are Hkely the descendants of a few founders and 
therefore do not exhibit a strong signal of population genetic structure. 

The perception of population structure, even in highly vagile ani- 
mals where it was least expected, has been refined by steadily 



improving molecular data and geographic sampling. Initially, using 
mtDNA restriction site polymorphisms and nuclear microsatel- 
lites, no evidence of population structure or isolation by distance 
was found in coyotes, even at the continental scale^^'^\ Various 
behavioral and historical explanations have been invoked to explain 
these early genetic patterns. But a more likely explanation is that 
the patterns of weak differentiation were artifacts of sparse geo- 
graphic sampling or poor resolution due to the use of few molecu- 
lar markers. More recent studies employing advanced analyses of 
spatial and genetic data have revealed strong differentiation among 
parapatric populations of coyotes and wolves, even in the absence 
of physical barriers to movement^ '^^"^l These investigations used 
multiple loci and dense geographic sampling to uncover cryptic 
genetic subdivisions. Strong genetic differentiation between adja- 
cent populations of coastal and inland wolves in British Columbia 
was shown with mtDNA^'^, demonstrating that fine- scale genetic 
differentiation can be detected with denser sampling alone, even 
using a single molecular mtDNA marker. Similar cryptic subdivi- 
sions have been discovered in several highly mobile groups, such as 
Lynx"^^'^^, ungulates'^^"'^'^, cetaceans^^'^^', and hawks"^^. In all these cases, 
genetic subdivisions appear to emerge from ecological factors 
and local foraging adaptations. Future studies should focus on the 
ecological mechanisms underlying the cryptic genetic structure in 
northeastern coyotes, especially because they have only inhabited 
the region for the last 30-80 years-. Confirming ecological deter- 
minants of population structure in the absence of obvious physical 
dispersal barriers would provide an interesting example of rapid 
ecological differentiation. 

There are some important similarities and discrepancies between 
mtDNA and autosomal SNP patterns. The data indicate that coyotes 
in Ohio are the most genetically diverse in the region when sur- 
veyed with nuclear SNPs, as with mtDNA\ However, the gradual 
eastward decay in genetic diversity observed with mtDNA is not 
replicated with nuclear SNPs. The marked reduction of heterozy- 
gosity in the contact zone and in the overall region is very likely 
caused by population structure, i.e., the Wahlund effect. This 
interpretation is supported by the increase in the number of loci in 
Hardy-Weinberg equilibrium at smaller geographic scales and by 
the congruent signal of population structure in the overall region 
and in the contact zone. In addition, the three primary populations 
detected in this study do not correspond to the three subdivisions 
inferred with mtDNA\ Together, these results suggest that studies 
based solely on mtDNA should be interpreted cautiously. For 
example, mtDNA sequence similarity suggested that a small popu- 
lation of Scandinavian wolves was founded by individuals released 
from Swedish zoos, but nuclear polymorphic markers falsified the 
release hypothesis and instead supported a hypothesis of natural 
immigration or expansion from an unknown relict wolf pack^l 
Discrepancies between patterns observed with mtDNA and nuclear 
DNA may be caused by true organismal processes, such as sex- 
biased dispersal^'^ However, there is no evidence for sex-specific 
dispersal behaviors in eastern coyotes, consistent with their monoga- 
mous breeding system^"'' ^ Alternatively, discrepancies may be 
caused by marker- specific phenomena such as effective population 
size, lineage sorting, mutation rate, and coalescent times^^, or the 
violation of certain assumptions of mtDNA inheritance, such as 
recombination, paternal leakage, and heteroplasmy^\ Future stud- 
ies should further evaluate these sources of discrepancies. 
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The present study underscores three related methodological issues 
that are of broad interest, especially as SNPs continue to be in vogue 
in population and wildlife genomics. First, these data confirm that 
SNPs discovered in a model organism are an appropriate tool to 
address various questions regarding the ecology and evolution of 
non-model relatives. The sequencing of the dog genome-^ quickly 
enabled SNP-based investigations into wild members of the family 
Canidae, including coyotes, wolves, jackals, and foxes^'^^"^^. Second, 
this study highlights the importance of evaluating the ascertainment 
bias of markers employed in a survey of genetic variation, especially 
in multi-species comparisons. Many SNP-based studies are not 
addressing the issue of ascertainment bias^^. In the present study, 
the genome- wide analysis of variation is dog-biased because SNPs 
were ascertained primarily from comparisons of boxer and poodle 
genomes after the completion of the dog genome project^ On the 
other hand, the high-heterozygosity SNP analysis is coyote-biased 
because the 16 SNPs were chosen from an ascertainment panel of 
northeastern coyotes. The bias in the latter case is not problematic 
because the ascertainment panel from which the loci were selected 
is representative of the whole population of northeastern coyotes. 
However, the diversity measures reported in this study should not 
be compared to similar measures from other populations, unless 
the comparison corrects for ascertainment bias^l Lastly, this study 
emphasizes the necessity of selecting SNPs very carefully to match 
the research question of interest. Here, a set of high-heterozygosity 
SNPs was interrogated in order to examine geographic patterns of 
genetic diversity and population structure. But other research ques- 
tions may require polymorphic markers with other properties. For 
example, in order to better understand the complex hybrid ancestry 
of the northeastern coyote, Monzon et al?^ used species-diagnostic 
SNPs to quantify the relative contributions of its parental popu- 
lations. In addition, recent advances in the molecular genetics of 



phenotypic traits in dogs allow the use of SNPs linked to genes of 
known function to address long-standing questions about morpho- 
logical, physiological, and behavioral adaptations in northeastern 
coyotes and other wild canids^'^'^. 

Data availability 

figshare: Data of genetic diversity in northeastern coyotes, http:// 
dx.doi.org/10.6084/m9.figshare.9434836o 
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First, thank you for the opportunity to review Javier IVIonzon's article, "First regional evaiuation ofnuciear 
genetic diversity and popuiation structure in norttieastern coyotes (Canis latrans)" 

I found this to be a well-written, engaging, and thoughtful manuscript with important contributions to the 
use of SNPs in studies of population structure and in elucidating the genetic structure of the northeastern 
coyote. The author adeptly targets a need for using multiple markers across multiple individuals for proper 
inference. The author furthermore supports the use of the canine SNP array in studies of coyotes, 
following the recommendation of vonHoldt etal. (201 1), while also asserting the need for addressing 
ascertainment bias prior to the use of SNPs in non-model organisms. 

I have no major reservations, rather a few minor comments regarding clarification of the methods along 
with some questions and considerations for future discussion and research. 

Methods: 

1 . With such a large sample of high quality tissue available for the analysis, why were scats (n=6) 
included in the analysis? Were the scats important for adequate sampling across the study area? 

2. Fourteen of vonHoldt etal's. (201 1) coyote specimens were included in the initial selection of the 
16 SNP panel used in subsequent analyses. Why were only 7 of vonHoldt etai.'s (201 1) samples 
included in the analysis of diversity and structure? 

Results: 

1 . Using the SNP panel, the author reports the observed and expected heterozygosity for the three 
putative population zones identified by Kays etal. (2010; Table 2). It would be interesting to see 
the Ho and He for the population clusters identify by the current STRUCTURE analysis (i.e., K =2 & 
K=3). 

2. What are the pairwise Fst values for two genetic clusters also supported by the STRUCTURE 
analysis? 

Discussion: 

1 . There is ample evidence of genetic structure; however, I'm not sure that I would agree with the 
author's suggestion that there is "some geographic structuring" at least based on Figure 2c. The 
author does a thorough job of listing the factors and processes that may be involved in explaining a 
lack of geographic structure, as well as the discrepancy between the current nuclear analysis of 
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population structure and the previous mtDNA analysis (Kays etal. 2010). 



• A possible further explanation for the lack of geographic structuring could be that a SNP 
panel - with such high heterozygosity - may actually reflect older mutations that have had a 
longer time to move around the landscape via dispersal (see Morin etal. 2004). This might 
allow one to detect patterns of genetic structure without any obvious geographic patterns. 

• The author also mentions, correctly, that based on the current state of knowledge, there is 
no sex-bias in coyote dispersal. I wonder if the author's data set could be used to 
investigate this by factoring in sex and rerunning the analyses presented in this manuscript. 

I have read this submission. I believe that I have an appropriate level of expertise to confirm that 
it is of an acceptable scientific standard. 

Competing Interests: No competing interests were disclosed. 




Linda Rutledge 

Biology Department, Trent University, Peterborough, ON, Canada 



Approved: 01 May 2014 

Referee Report: 01 May 2014 

doi:1 0.5256/f 1 0OOresearch.381 9.r3944 

This is a very well-written, interesting article on utilizing highly-polymorphic SNPs to identify cryptic 
substructure in a population of eastern coyotes. The analyses are comprehensive (including Bayesian 
cluster analysis, principal components analysis, and Fst calculations) and the data do seem to support 
fine-scale population substructure. If it weren't for the extensive analyses I would find it very difficult to 
believe that 16 SNPs (even highly polymorphic ones) could find such fine-scale structure because it has 
been estimated that it take at least 15x more SNPs than microsatellites to identify structure in populations 
(HaasI & Payseur 201 1 , Heredity). Presumably the same clustering patterns would remain if the samples 
were analyzed at thousands of genome-wide SNPs that have not been selected for their polymorphism - 
i.e. presumably more, less polymorphic loci should show the same substructure. 

The biological/ecological reason for the clustering remains unclear (but that is clearly beyond the scope of 
the paper and is really quite difficult to tease apart). The author provides plausible explanations for the 
structure (e.g. "merging of the two colonization fronts"; possible deer or human density patters), but I 
wonder if social structure (i.e. family/pack groups) may contribute to the genetic patterns found. 

Overall, the manuscript reads very well and provides a rigorous analysis of a novel dataset that 
complements the authors other work. 

I have read this submission. I believe that I have an appropriate level of expertise to confirm that 
it is of an acceptable scientific standard. 

Competing Interests: No competing interests were disclosed. 
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Denmark 

Approved: 18 March 2014 

Referee Report: 18 March 2014 
doi:1 0.5256/f 1 0OOresearch.381 9.r3946 

Thank you for the opportunity to review this manuscript. Javier IVIonzon presents a timely and relevant 
study of population structure in Northeastern coyotes, which complements earlier work in this area and 
highlights new and intriguing research questions. The author builds on previous genomic analyses of 
canids with a high-density SNP chip yet a limited sample of Northeastern coyotes, and a study of 
northeastern coyotes that used several hundred samples but a single genetic marker. The panel of 
selected SNPs is biased toward the study population of Northeastern coyotes, but the author makes this 
very clear and includes an evaluation of ascertainment bias across canid groups included in the initial 
study with the high-density SNP chip from which this panel of 16 loci was selected. 

The article is well written, and the author has done a good job explaining the main limitations of the work 
and outlining why findings should be interpreted with caution. I have one suggestion on supplementary 
data that could be included for an additional perspective on genetic structure, and some minor comments 
on the manuscript text: 

1 . The author describes the SNP results relative to the areas of Ohio, the contact zone and the 
northeast zone from Kays etal. 2010, and in Table 2 notes that "most individuals genotyped at 16 
nuclear SNPs represent a subset of those individuals sequenced" for mtDNA. If individual mtDNA 
results are available for coyotes, it would be valuable to see the extent to which the mtDNA 
haplotypes classified as Great Lakes wolf (or eastern wolf), coyote and gray wolf from Kays etal. 
2010 (Fig 2) correspond with genetic structure based on SNPs.This would provide important 
information to help understand how similarities/discrepancies in genetic structure are distributed 
across space. 

2. The relatively low genetic diversity for coyotes in Table 3 and the results from vonHoldt etal. 201 1 
suggest a bias against coyotes, although coyotes may at least in some regions, have higher 
genetic diversity than wolves considering their range expansion and increase in population size. It 
might be worth noting that sequencing coyotes directly could provide a more in-depth view of 
genomic regions under selection. Although I agree with the author's conclusion about the utility of 
using a closely related and well-studied species, alternate methods such as 
genotyping-by-sequencing could be recommended as possibilities for future research. 

3. Introduction, 1 sentence "Historically restricted to the open deserts and plains...". Though the 
coyote is widely regarded as a species of western and central North America, it is difficult to 
exclude the possibility that it could also have been present farther east prior to European 
colonization and landscape modifications. 

4. Introduction, 3''^ paragraph: 

- The sentence "Population genetic structure plays considerable roles in evolution... the splitting of 
one species into two if the environments are markedly different". In addition to adaptation, it may 
be important also to mention genetic drift, and subsequent development of incompatibilities 
between isolated populations. 

- The sentence starting with "The detection of genetic structure largely depends on the type and 
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number of molecular markers examined": Here you may want to mention explicitly the temporal 
resolution of markers with different mutation rates (as you have done in the Discussion). 



- "SA/Ps have become a popular and Inexpensive tooF: perhaps ''Increasingly affordable" or similar 
would be better here? 

5. Discussion, 3''^ paragraph, sentence: "...cryptic genetic structure of Northeastern coyotes, 
especially as they have only Inhabited the region for 30-80 years". The author notes earlier in the 
text that coyotes in the northeast zone may descend from a few founders, which likely arrived via a 
northern route through Ontario, whereas a second colonization front occurred farther south through 
Ohio. Bierne etal. 201 1 proposed that intrinsic (environment-independent) genetic 
incompatibilities may coincide with ecological boundaries, as these environmental transition zones 
often "trap" the tension zones resulting from e.g. assortative mating. Such processes might be 
influencing the contact zone between the southern and northern colonization fronts, and interact 
with the selective pressures associated with forested, agricultural and urban habitat, and deer 
density, noted in Monzon 2012 (PhD Dissertation). 

I have read this submission. I believe that I have an appropriate level of expertise to confirm that 
it is of an acceptable scientific standard. 

Competing Interests: No competing interests were disclosed. 
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